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Abstract. Recent developments of Baxter algebras have lead to applications 
to combinatorics, number theory and mathematical physics. We relate Baxter 
algebras to Stirling numbers of the first kind and the second kind, partitions 
and multinomial coefficients. This allows us to apply congruences from number 
theory to obtain congruences in Baxter algebras. 



1. Introduction 

Baxter algebra was first introduced by Baxter [2] in 1960 to study the theory of 
fluctuations in probability. Let C be a commutative ring and let A S C be fixed. 
A Baxter C-algebra of weight A is a pair {R, P) in which i? is a C-algebra and 
P : i? — > i? is a C-linear map such that 

(1) P{x)P{y) = P{xP{y)) + P{P{x)y) + \P{xy), Vx, y e R. 

The study of Baxter algebras was continued through 1960s and 1970s ^1 E] 
and has experienced a quite remarkable renaissance in the last few years with 
applications to renormalization theory in quantum field theory llOj . multiple 
zeta values in number theory jl5j and umbral calculus in combinatorics, as well 
as in dendriform algebras and Hopf algebras . We will give connections of 

Baxter algebras to Stirling numbers, multinomial coefhcients and partitions. 

Stirling numbers of the first kind and the second kind were introduced by J. 
Stirling j2()| in 1730 and have been studied in the past by well-known mathemati- 
cians like Euler, Lagrange, Laplace and Cauchy. These numbers play important 
roles in combinatorics, number theory, probability and Statistics. There is a large 
literature on these numbers, as can been seen in the survey article [7j. 

For each positive integer n, let = t{t — 1) ■ ■ ■ {t — n + 1) G Q[t] be the 
descending (falling) factorial. Also define {t)o — 1. Stirling numbers of first kind, 
denoted by s(n, k), and Stirling numbers of the second kind, denoted S{n, k) with 
n, /c S N, are defined to be the coefficients in the expression 



(t)„ = 



fc=0 



and in the expression 



We will be mostly concerned with Stirling numbers of the second kind. So we just 
mention that the two groups of numbers have similar properties and are related by 
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the duality 

oo oo 

(2) ''^^s{n,k)S{k,Tn) ~''^^S{n,k)s{k,m) = Sn^m, n, m S Z, 
k=o k=a 

where Sn.m is the Kronecker delta. 

Stirling numbers of the second kind can be recursively defined by 

(3) S{n + 1, fc + 1) = S{n, k) + [k + l)S{n, fc + 1), n, fc > 
with S'(0, 0) = 1, S{n, 0) = S'(0, k)^Q for n, fc > 1. It follows that 

S{n, 1) = S{n,n) = 1 

for n > 1. 

Stirling numbers of the second kind also have the following equivalent descrip- 
tions: 

(1) S{n, fc) = ■^[A'^t"]i=o, where A is the forward difference operator. 

(2) S{n, fc) is the number of partitions of n objects into k non-empty cells. 

(3) klS{n, fc) is the number of surjective maps from a set with n elements to a 
set with fc elements. 

oo n 

(4) exp[t(e" - 1)] - 5m '^(^' k)t''u''/n\. 

n=0 k=0 

Our first goal is to give an interpretation of Stirling numbers of the first and the 
second kind in terms of Baxter algebras. This is given in Section[21after introducing 
the concepts of free Baxter algebras and mixablc shuffle products. In order to ease 
notations and focus on such connections, wc assume that the weight A of the Baxter 
algebra is 1 in this introduction. We will consider Baxter algebras of any weight A 
in later sections. 

Theorem 1.1. f^Theorem 12. l]! For any Baxter algebra {R, P) of weight 1 and 
integer n>l, we have 

n n 

(4) P(l)" ^fc!S'(n,fc)P'=(l), n!P"(l) = ^s(n,fc)P(l)^ 

fe=i fc=i 

Recall that klS{n, fc) is the number of ways to put n different objects into m 
different non-empty cells. It is also the number of surjective maps from {1, . . . , n} 
to {1, ... , fc}. Further, klS{n, fc) = A'=i"|t=o. 

From the theorem, we formally have 

oo oo n 

gP(i)n ^ p(i)»u'7n! = '^('^' k)klP''{l)u''/nl G R[[u]]. 

ri=0 n=0 fc=0 

So e^^"'^''" serves as a generating function of k\S(ji, fc) (compare to item (4) in the 
list before the theorem). We likewise obtain a generating function for multinomial 
coefficients (see Theorem 13. 2|l . 

Stirling numbers of the second kind enjoy many congruences |3I0|H|. The most 
well-known is the congruence 

(5) S{p,k)=0 (modp), 0<fc<p 

for any prime number p. There are several proofs of this property IB] and new 
proofs can be found as recently as 1997 ^H!- As a consequence of l@J and ©, we 
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obtain, for any Baxter algebra of weight A = 1, 

P{lf = P{1) mod p. 
We extend the relation ^ further in Section 13 and show that multinomial co- 
efficients ( " I and partitions also arise naturally in the products of Baxter 

Y ni,--- ,nk J 

algebras (Theorem 13 .21) . This allows us to recover the congruence Q using multi- 
nomial coefficients fProposition 13 . 3|l . 

The congruence (O is equivalent to Fermat's little theorem 

aP = a (mod p), a G Z. 

We apply (01 and (|SJ) in Section0]to generalize this theorem for elements in Baxter 
algebras and obtain 

Theorem 1.2. (^Theorem I4.1|l Let A he a commutative C-algehra and let 
be the free Baxter algebra on A of weight 1. If = a mod p for all 
a ^ A, then = a mod p for all a £ IIIc(A). In particular, taking C = Fp, then 
aP = afor allaGmr^{¥p). 

2. Stirling numbers in Baxter algebras 

We start by reviewing basics on Baxter algebras and the construction of mixed 
shuffle Baxter algebras as a model of free Baxter algebras. For further information 
about Baxter algebras, see |14[I12| and references given there. 

Effective study of Baxter algebras depends on explicit constructions of free Bax- 
ter algebras. This is in analog to the study of commutative algebras benefiting 
from explicit constructions of the free objects (the polynomial algebras). For free 
Baxter algebras we have mixable shuffle algebras. We will need the most general 
construction of mixable shuffle Baxter algebras. So we will summarize this con- 
struction first. More explicit constructions of special cases will be introduced later 
as needed. 

Given a commutative algebra C, A G C as above, and a C-algebra A, the free 
Baxter C-algebra on A is defined to be a Baxter C-algebra {Illc^x{A), P4) together 
with a C-algebra homomorphism jy^ : A —> Ulc.xi^) with the property that, for 
any Baxter C-algebra {R, P) and any C-algebra homomorphism f : A R, there 
is a unique Baxter C-algebra homomorphism / : (UIc >(yl), P4) {R,P) such 
that o / = / as C-algebra homomorphisms. 

One realization of this free Baxter algebra is given by the mixable shuffle Baxter 
algebra. The mixed shuffle Baxter algebra is a pair {lllc,\{^), Pa) (by misuse of 
notations), where UIc,a(^) is a C-algebra in which 

• the C-module structure is given by the direct sum 

00 

^ A(E)c . . .(S>c 4 ; 

n — lactors 

• the multiplication is given by the mixable shuffle product o, recursively 
defined on A®™ x A®" by 

ao o (60 61 ® . . . ® 6„) = ao5o (Xi 61 (g) . . . €5 &„, 
(ao ® ai (g) . . . Om) obo — aobo (g ai . . . ® a„i, a.j, bj G A'^^ — A, 
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and 

(ao (g) ai ® . . . am) o {bo ig) bi ig) . . . ® bn) 

= (aobo) ® ((ai (g) . . . (g) a™) o (1 61 (g) . . . (g) &„)) 
+ (ao6o) g) ((1 (g) ai g) . . . g) a™) o (61 g) . . . g) 6„)) 

(6) +Aao6o g) ((ai g) . . . g) am) o (61 g) . . . g) 6„)) , a^, bj £ A. 
The Baxter operator P4 is defined by 

PA{ai g) . . . g) a,„) = 1 (g oi g) . . . g) a„i, oi g) . . . g) a„i e yl®™, to > 1. 

Since the mixable shuffle product is compatible with the product on A, we will 
suppress the notation o. We will also express C and A from IIIc,a(^) when there 
is not danger of confusion. 
For example, we have 

(flo g) oi g) a2)(6o g) bi) 

= aobo (g ((ai g) a2)(l g) bi) + g) Oi g) 02) + A(ai g) a2)&i) 

= ao6o ® (ai g) (02(1 g) 61) + (fei(l g) 02)) + Aa26i) + foi g) ai g) 02 + aifei g) 02) 

= aobo g) (ai g) 02 g) 61 + ai g) 61 g) 02 + Aoi g) 02^1 + foi g) ai g) 02 + Aai^i g) 02) . 

Theorem 2.1. Let {R,P) be a Baxter C-algebra. For n > 1, 

n n 

(7) P(l)" = ^fc!5(n,fc)A"^''P'=(l), n!P"(l) = ^s(n,fc)A"-'=F(l)^ 

k=l k=l 

Proof. The theorem can be proved using properties of free Baxter algebras. We 
will give a more direct proof by induction. For the first equation, it obviously holds 
for n = 1 since S'(l, 1) = 1. Assume that it is true for n. By ^1 Prop. 6.1], 

(8) P''{1)P{1) = {k + l)P'=+i(l) + fcAP'^(l). 
Then we have 

P(l)"+i =P(1)"P(1) 

n 

= (^fc!5(n,fc)A"-'^P'=(l))P(l) 

k=l 
n 

^ J2 k^-S{n, k)X'-^{{k + l)P'=+i(l) + AfcP'=(l)) 

k=l 

n n 

= + k)X"-''P''+\l) + k\S{n, k)X"+^-^kP\l). 

k=l k=l 

Replacing fc by fc — 1 in the first sum we have 

n+l n 

P(l)"+i = ^fc!S'(n,fc- l)A'^'+i-"P'= +^fc!5(n,fc)A'=+i-"fcP'= 

k=2 k=l 
n 

= S{n, l)\^P{l) + fc!A"+i-'= {S{n, k - I) + kS{n, fc)P'=(l)) 

k=2 

+ {n+iy.S{n,n)P"+\l). 
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So using S{k, 1) = S{k, /c) = 1, fc e N and ©, we obtain 

n 

P(l)"+i = 5(n+l,l)A"P(l) + ^A:!A"+i-'=5(n+l,A:)P'=(l) 

fe=2 

+ l)!S'(n+ l,n+ 1)P("+1)(1) 

n+1 

= ^fc!A"+i-'=5(n + l,/c)P'^-(l)- 

fc=i 

This proves the first equation. The second equation then foUows from the duahty 
in Eq. (|2Jl between Stirhng numbers of the first and the second kind. □ 

It follows from Theorem 12. II and the congruence (0 that 

Corollary 2.2. Let (P,P) be a Baxter C-algebra and let p he a prime number. 

(9) P{lf = Af-ip(l) (mod p) 

A natural question to ask is whether lO holds when 1 is replaced by other 
elements of R. This question will be addressed in Theorem 14.11 

3. Multinomial coefficients and partitions in Baxter algebras 
Let X = {x}. Let 

(ii, • • • ,ik)\is > 0, 1 < s < fc,fc > 1}. 

For / e U, define the norm of / to be |/| = X]s=i *s and define the length of / to be 
£(/) = k. Define / > if > for 1 < s < fc. 
Denote 

and 

X = {x'^^\i e a}. 

Recall that the free Baxter algebra on C[x\ is ©„>i C[x]®" as a C-module, so is a 
free C-module on X together with the mixed shuffle product defined in © . 
The Baxter operator Px on \ilc{X) is defined by Px{x®^) = I® x®^ . Thus 
Px(mc(X)) = l®mc{X) is a free C-module on 1 (g) X := {l ® x®-^ | / e 3}. 

By the Baxter identity Px(UIc(-^)) is closed under multiplication. There- 
fore, for each n > 1, 

(10) (1 = S{n,I)(Sx®^ 

\I\=n 

for elements S{n, I) E C. The first few terms of S{n, I) are given by 

(l(8)x) = l(E)x, 
(l(8)a;)^ = 2(l(g)a:®x)-HA(l(X)x^), 

{li^xf = 6(1 a; 3A(1 a; ® x^) -I- 3A(1 ® ® x) + A^(l (g) x^), 

{1(g) x)^ = 24(1 (g) X (g)x (g) X (g) x) -I- 12A(1 (g) X (g) X (g) x^) + 12A(1 (g) X (g)x^ (g)x) 

+12A(1 x^ g) X g) x) 6A^(1 g) x^ g) x^) + 4A^(1 g) x g) x^) 

-H4A^(1 x^ g) x) A^(l x^). 

Lemma 3.1. S{n,I) ^ only if I > 0. 
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Proof. We prove by induction on n with n — I being obvious. Assuming the lemma 
holds for n. So 

|/|=n,/>0 

For a;®^ — x"^^ ® • • • ® a;**" , the mixable shuffle product © gives 
(1 ® (g) x) = 1 ® (x^'i'- + ■'"-^•^•'"'^ + ■■■+ 

This shows that the lemma holds for n + I. □ 

By Lemma 13.11 the sum in ()10(l is over the set of ordered partitions of n. For 
example, when n = 3, the terms on the right of (1 ® x)^ correspond to the ordered 
partitions 

71=1 + 1 + 1 = 1 + 2 = 2 + 1 = 3. 
For the coefficients in the sum, we have 

Theorem 3.2. (1) For I > 0, we have S{n,I) = f " ) A"-^(^\ 
(2) J2 Sin,I)^k\S{n,k)X"~K 

Recall that the multinomial coefficient ( " ) is the number of ways to 

y il,--- ,ik J 

distribute n distinct objects into k distinct sets Si, ■ ■ ■ , Sk with ii, • • • ,ik objects. 
By Eq. (|10|l and the above theorem, we formally have 

^ ^ ' n=Q n=() \I\=n ^ ' 



Here 



( " I = ( " I for / = (il, • • ■ , Zfe) by abbreviation and (1 ® = 1 by 
convention. Thus 1/(1 — 1 ® a:) can be regarded as a generating function for the 
multinomial coefficients 



Proof. (1). We prove by induction on £{I. It is obviously true for £{I) — 1. Assume 
that it is true for all £{!) with £{!) = n. Then 

J2 5(n+l, J)A"+i-^(-^)(l®a;®^) = (1 ® a;)"+i 

|J|=n+l 

= ^ 5(n,/)(l®a;®^)(l®a;). 

/|=n 

Let / = (il, • • ■ ,ik). Using the notation x^^ = a;*^ (g) • ■ •0a;*'= , the mixable shuffle 
product inj gives 

(1 x'^^){l (g)x) = 1(S) {x^'^'- '^'^'^^ + x^^'^'- + . . . ^ ^mM,- ,ik) 



X 



It follows that, the coefficient of J = (ji, • • • ,jk) with | J| = ji + ■ ■ ■ + jk = n + 1 has 
contribution exactly from It defined by It = (ji, • • • Jt-i,jt - 1, ji+i, • • • , jfc), 1 < 
t < k, with the convention that it — jt — 1 is deleted from It in the event that 
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jt = 1. Then i{It) = 1{J) if jt > 1 and e{It) = £{J) - 1 if jt = 1. For example, 
when J = (2, 1,4), we have h = (1, 1,4), h = (2,4), and h = (2, 1,3). We then 
have 

S{n + 1, ./)A"+i-^(^) = ^ ^(n, /t)A"^^(^') + ^(n, /t)A"-^(^')+i 

jt=i jt>i 

= ^ ^(n, /t)A"+i-^(^) + J2 Sin, /t)A"+i-^(^) 

jt=i jt>i 

k 

= ^5(n,/t)A"+i-^(^). 
t=i 

So by induction wc have 

^(n+l,J)A"+i-^(-') = y( " 1 

Where the last equation is the generalized Pascal identity |17[ p. 24]. 

(2). Once part 1 is proved, part 2 is a standard property on Stirling numbers 
and multinomial coefhcients. We give a direct proof using properties of free Baxter 
algebras instead of part 1. Since III({a;}) is the free Baxter algebra on {x}, there 
is a unique Baxter algebra homomorphism 

/ : m{{x}) ^ m(c) 

induced by f{x) — 1. We have 

(10 1)" = /((l®x)") 

= ^ /(5(n,/)®a:«^) 

\I\=n 

= ^ ^(n,/)®/(a;®^) 

\I\=n 
\I\=n 

= ^^(n,/)®l®^ 

\I\=n 

= ^ 5(n,/)®l«^(^) 

|/|=ri 

\I\=ne(I)=k 

On the other hand, by Theorem 12. II 

n 

(1 ® 1)" = Px(i)" = E ® I)''- 

fc=i 
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Since {1 (g) 1®*^] fc > 1} is linearly independent over C, we obtain 

J2 Sin,I) ^ k\Sin,k)\''-\ 

\I\=n,e(I)=k 

as needed. □ 

We now deduce congruences of S{n,I). As a result, we obtain a proof of the 
classical congruence 

Proposition 3.3. Let p be a prime number. 

(1) ^15(^,7) for any 1^3 with \I\ — p and I ^ {p). 

(2) p\Sip,k) forl<k<p. 

(3) (1 (g) x)P = AP-i ® xP (mod p). 

(4) For any Baxter algebra {R,P) and a E R, we have P{a)P = XP^^P{aP) 
(mod p). 

(5) Let XP~^ — A and let lllf^(¥p) be the free Baxter algebra over ¥p of weight 
A. For a e IIlFp(Fp), we have = A^^^a. 

Part 5 of Proposition 13.31 can be regarded as the Fermat's little theorem for 
Baxter algebra in a special case. We will consider the general case in the next 
section. 

Proof (1). By Theorem E3 S{p,I) ^ ;XP-^. For I ^ (p), we have 

ni\---nk\ 

p I ni! • • • n/c! since Ui < p. The claim follows. 

(2) . From (1) and Theorem O (2), we have p\klS{p, k)XP-'' , 1 < k < p. So we 
just need to take A = 1. 

(3) . This follows from 

(l®x)P= J2 S'(p, /)AP-^(*=^ ® x®^ = AP-i = AP-i ® a;P mod p. 

\I\=p,£{I)=k 

(4) . Since III({x}) is the free Baxter algebra on {x}, the assignment x a induces 
a Baxter algebra homomorphism / : ni({.T}) —^ R. Applying / to the congruence 
in (3) gives (4). 

(5) . We first prove (1®")p — l"*" for n > I with the case of n = 1 being clear. 
Using (4) and induction, we have 

= AP-ip((l®")f) 
= P((l®"f) 

Since {l®"|n > 1} is a basis of IIlFp(Fp) and IIlFj,(Fp) has characteristic p, (5) 
follows. □ 

4. Fermat's little theorem for Baxter algebras 

Now we consider free Baxter algebras in general. Let A be C-algebra. Recall 
that the free Baxter algebra IIIc(A) is 



71>1 
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with multiplication given by the mixable shuffle product and with Baxter operator 
PAiai ® • • ■ a„) = 1 (X) ai ® • • • (X) a„ for ai ® • • • (g) a„ e yl®". An Baxter ideal of 
UIc{A) is defined to be an ideal / of UldA) such that P{I) C /. 

Theorem 4.1. Let p be a prime number. Assume that the weight A of the Baxter 
algebra is invertible. 

(1) forai®---(8)a„G A®", 

(ai • • • ® a„)P = A("-i)(f-i)< • • • ® mod p. 
In particular, if X — I, then 

(oi (g) • ■ • ® GnY = ® ■ • ■ (g) mod p. 

(2) Let I be the Baxter ideal of lIIc{A) generated by p and {a^ — a\ a £ A}. 
Then for a G mc{A), aP = a mod /. 

(3) Let A be a C-algebra of characteristic p such that a^ — a for all a £ A. 
Then — a for all a G IIIc(j4). 

Remark 4.1. (1) Taking a tensor product and taking a power cannot be inter- 
changed in general in IIIc(^)- This is similar to taking a sum and taking 
a power. So (1) can be regarded as a tensor form of the freshman's dream 
(a + by = aP + bP mod p. 
(2) Let A be the ¥p-algebra ¥p[x]/{xP — x), then aP = a for all a £ A. This 
follows from Fermat's little theorem and the freshman's dream. The free 
Baxter algebra IIlFp(A) is a quite large algebra over A, 

mr^{A)=^A^\ 

n>l 

But we still have aP = a for all a £ Tilf^ (A) by the theorem. 

Proof. (1). By (4) of Proposition rOl we have 

{ai (E) ■ ■ ■ (g) a,-,)P = {aiPA{a2 (E) ■ ■ ■ <E) a,i))P 
= aP^PA{a2 <»■■■<» an)P 
= aPXP-^PA{{a2^---®an)P) 
= A^'^af g) (02 <g) • • ■ g) a„)P mod p. 

So by induction, we have 

(ai ® • • • (g a„)f = A(f-i)("-i)af • • • (g mod /. 

(2) . By part 1 we have 

(ai g) • ■ • (g an)P = A("-i)(f-i)af g) • ■ • (g = X^'^-^'^^p-^^ ai g) • ■ • (g a„ mod /. 

Since A is invertible, from A^ = A mod / we have A^""'^ = 1 mod /. (2) follows. 

(3) . This follows from (2), since the ideal / is trivial here. □ 
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